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(54) Automatically scrubbing ECO errors in memory via hardware 



(57) The present invention provides a method and apparatus for automatically scrubbing EGG errors in 
memory upon the detection of a correctable error in data read from memory. This is performed by providing in 
a memory controller memory control logic for controlling accesses to memory, an ECC error checking and 
correcting unit for checking data read from memory for errors and correcting any correctable errors found in 
the read data, a first data buffer for storing the corrected read data output from the ECC error checking and 
correcting unit and a writeback path having an Input end coupled to an output of the first data buffer and an 
output end coupled to memory. Upon the detection of a correctable error in data read from a particular 
memory location, the ECC error checking and correcting unit signals to the memory control logic the existence 
of a correctable error in the read data. The memory control logic then obtains exclusive control over the first 
data buffer and the writeback path to control writing of the corrected read data onto the writeback path and 
subsequently to memory. 
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At least one drawing originally filed was informal and the print reproduced here is taken from a later filed formal copy. 
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METHOD AND APPARATUS * FOR 'AUTfHATICALLY SCRUBBING ECC 
ERRORS IN MEMORY VIA HARDWARf 

Ft AriCCROTTNJn OF THF TNVENTION 
pFT n OFTHF TMVFNTTION: 

This invention relates to data communications in a computer system, and 
5 more specifically, to the detection and correction of correctable errors in data 
transmitted between a microprocessor and a memory subsystem. 

To maintain data integrity, most computer systems on the market today 
utilize error correction code (ECC) schemes implemented within ECC circuitry of a 

10 memory controller to detect, and in some cases, to correct errors detected in data 
read from memory. In computer systems that utilize ECC schemes, data to be 
written to memory is first encoded into a code word by appending the data with ECC 
parity bits derived by calculating the scalar product between the data and the 
generator matrix of an established ECC code. Upon reading the code word out of 

15 memory, it is decoded using a transform of the generator matrix to produce the 

original data in addition to a syndrome which can be used to detect and classify any 
occurring errors. 

If no errors have been detected, the data read is placed on the system bus for 
subsequent use by the microprocessor or other bus agent. If an uncorrectable error is 
20 detected, the data is again placed on the system bus while the error condition is 

reported to the microprocessor. If, however, a correctable error is detected, the code 
word is first input into a correcting circuit to correct the data which is then output 
onto the system bus. (It is noted that if the computer bus is of a pipelined nature. 
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^ ^r^r — used » .nurface U« sys.» .us » memory wo^d .te,y 

passes between the system bus and memor)-.) 

^ .b^ve^escHbed pocess effectivdy « system reUsbUity since aU 

data entering *e system Is checked for ECC errors before it is uBlized. However, 
r:Te,«JmostECCsche.nescanor.yco^.s^eanddo„biebiterr„rs 

w tbin a byte, the occurrence o, multiple errors in . ideation m memo^ . 

.0 cause the data at that iocaUon to be uncorrectabie, .^y s^^'^r^^ 
2^anyopera.ionsw.chneedU,a,data..c.rd..,y.s,s.mde.s^^^ 
, aevelopedsoftwareWHCCscrubbingprocesseswhichcorrecterrormdata 

retrieved from memory write the corrected data back. 

^ a firs. convenUona, soft.-are-based scr.*bing process, the detection of a 
.„.ctab,e error in data being read from memory cau^ microcode to sa« *e 
to^cal address of the erred memory iocation and generate a system cail to 
. :Cri».tine. The invoked interrupt routine then uses the iogicai address to 
.e phystca, address of the memory ioeation and to re-read the data .om 
t specified memory iocation M the data is again read from memory, .s mpu, to 
^^icheckingandcorrecdngc^^t tocorrec, ^e data. After the data has been 
:^ed,««i:.erruptservicerout,„e issues— nstothemicroproc^or^^ 

^ cause *e data to be written back to the same iocation in memory once the data has 
been piaced on the system bus and the appropriate requests are made to the 
microprocessor. 

Obviously, a major drawback with such a process is that it requires a 
substantial amount of .me in addition to a significant amount of sofn^are suppo« 
^ in order to perform even minor scrubbing operations. Because titis process entatis 
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completely separate process from that previously used to check and correct errors in 
the data when it is initially read from memory, it requires that a relatively complex 
interrupt routine be programmed in macrocode for re-calculating the physical 
address of the memory location and issuing the read and write commands which 
5 control the saubbing operation. Hence, such a soft^vare-based process does not only 
require significant design modifications to the entire system, but it also reduces 
system performance due to the fact that the routine must take the time to re- 
calculate the proper address, generate the proper commands and duplicate many of 
the steps which were already performed during the initial memory read. 

10 Alternately, in another conventional, software-based scrubbing process, 

software is used to periodically (but continuously) scan through memory to check 
.. • for and correct errors that it finds. This is accomplished by sequentially reading each 
memory location, checking the data for errors, correcting any correctable errors that 
- • are detected and writing the corrected data back to the same location. Data that is not 
15 ■ in error is simply discarded after being checked. Yet, since this is similar to the 
. above-described mechanism, this process also suffers from the amount of time and 
■ software support needed to perform the operation. Furthermore, this process also 
utilizes a significant amount of memory bandwidth, thereby preventing the most 
effective utiUzation of the memory bandwidth and impacting system performance. 
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MiiniiMvnrTHFi'^'^'^™ 

Acccdingly, « is an ob,.c, of tt.e p.-. — .o provide a me^.^ ^ 
apparatl^a!— s.u...sHCC..o.U..e»o,upon.Keae.ec.o„o. 

5 i„,p,«nen»d mechanism fta. is »»par«...o software. 

.s ano««, obiec. o, the presen, invention .0 provide a me.»»d „d 

^paranrs .or a„«,ma«ca„y scrubbing BCC errors in memory vU a v,rr.eb.* p m 
:!;:d:e.een*eoo.p,.o,*.reada„d„ri.eda,abu,.rsupon*ede^.»no. 

correclable errors in data read Irom memory. 

I, U a further object of the present invenUon to provide a method and 
apparl for automaticaiiy scrubbins BCC ^rs in memory w«ch subs«nba^ 
Zlses system performance by avoiding the use of cosUy interrupt roubne^^ and 
:::^o.yaslaU fraction Of .he memory band„idthtocorreCspe..cai.y. 

identified errors the memory read data. 

„ is yet another obiec, of the present invention to provide a method and 
appara^s fir automaticaUy scrubbing BCC errors in memory wiach mcrea^s system 
«byimmediate,ycorrechnssing,ebi.ern>rsinmemorydatabeforeasecond 

single bit error to the same location causes a lo. of the memory data. 

The present invention provides an efficient and Uansparent scrubbing 
.0 mechanism within a memory — for „ri«ng data that has been corrected b^ 
KC hardware bac. into memory, thus scrubbing the memory iocalon „ error. The 
Lmory controller in wMch the scrubbing mechanism U Implemented comprrses at 
L. m mory a.n«>l logic for controlling the How of daU to and from memory m 
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addition to a single data buffer, although in the preferred embodiment, separate read 
and write data biiffers are provided. 

Data read from memory is checked for errors by the ECC hardware and stored 
. in the read data buffer. If the memory controller detects a correctable ECC error in 
5 the read data, it corrects the data as it is being written into the read buffer. As is 
normally done, the read data is subsequently placed on a system bus for use by a 
microprocessor or other bus agents coupled thereto. However, to scrub the erred 
data in memory, a write back path is provided from the output of the read data 
buffer to the output of the write data buffer. 

10 When ready, the memory control logic issues a memory scrub command to a 

datapath state machine, which controls the flow of data to and from memory, to 
signal that the corrected data within the read buffer is to be written back to the 
memory location from which it came. Upon decoding the command, the state 
machine first issues a read source select signal to a read data buffer control 

15 multiplexor to select the control source which will issue commands to access the 
particular buffer and initiate a read operation therefrom. The state machine also 
issues a buffer select signal to an output multiplexor which has as inputs both the 
write data and the read data from the respective buffers and has its output coupled 
to memory via memory input/output buffers and memory interface components. 

20 Once this has been done, the selected control source then asserts a read strobe 

to the read data buffer to read the data out of the buffer. The data passes through the 
output multiplexor and into an ECC code word generation unit where a 
corresponding code word is produced and stored within the memory input/output 
buffers. The control logic then issues the appropriate address and request attribute 

25 signals to the memory interface components, which in turn generate the 
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BAS, CAS. s.,n.s so a., *e a.U . « 

,«ato. in memory sp«iB.d by crigtaal read request 

According .o a flrs. al.er„,.e embodirr^n.. *is mechanism can aUo be used 
« perfcrm periodic software-based scrubbing of .he memory arxay. Smce -y 
S To^Jin L is .u«,mauc.,y scrubbed upon correcUon of erred read dau. e» 
' rrm:m:ry.anbescrubbedbyse,uen.ai.yread.geacb.oca.o„inmemory. 

According .0 a second ,.,er™.e embodimeni. ^ same mechanism can be 
. „,w ECC cod. word when a read-modifywrite operation is 
::o:rr::S:-ynrs..ori„g.hepartiaiwri.eda«.«»^»^^^^ 
„ ^^erand^nreadingU^daUiobemodifiedfrommemoryandsf^^m^ 

„ad da,a buffer. Wim Ure appropriate con.ro, signals being .ssued from .he oo«r 
ibom^^readandwriiedauis merged in *eou.pu.multipie«or as ...bemg 

written to memory. 
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BRTFF DESCRIPTION OF T HE DRAWTNC S 

Figure 1 is a block diagram of a generalized computer system comprising a 
plurality of microprocessors and a memory subsystem in which a memory 
controller is coupled to a memory array via memory interface components. 

5 Figure 2 is a block diagram of the memory controller shown in Fig. 1 in 

accordance with the present invention. 

Figure 3 is a block diagram of the specific components of the memory 
controller and their interconnections which form the automatic scrubbing 
mechanism of the present invention. 
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nrrffl FT pf^-i^qm of invfntion 

n« pr«en, invendon provide a method and apparatus for automatically 
^rubbtag BCC errors in memory upon detecting correctable errors in data read tarn 
memory through the use of a writeback path coupled between the outputs of the 
read and write dau buffers of a memory conUoUer. For purposes of explanabon, 
.pacific embodiments are set forth in detail to provide a thon,ugh understanding of 
,Ke present invendon. However, it wiU be apparent to one skilled In the art .ha. tta 
present invention may be practiced with other embodiments and wi*ou. 11 the 
specific details sc. forth. In other ins.ances, well known archi««ur.l elemenU. 
devices, circuits, process steps and the like are no. se, forth in detail in order to . 
avoid unnecessarily obscuring tt«e present invention. 



1 n-,-m-, in TrF TP''"™"''^^"NTROLLEB 

to accordance wi,h one embodiment of .he presen. invention, .he scrubbing 

15 mechanUm is implemented in a pipeUned, rnul.iprocessor computer sysiem. 
generally shown in Fig. 1. which comprises up to 4 miaoprocessors, a system bus 
and a memory subsystem which preferably includes a DRAM memory array, a 
memory controller (MC) for handling memory access requests and memory 
interface components (NDCs) for interfacing the MC to to memory array by 

20 multiplexing r«d data from and buffering write data to the interleaved memory. 

The system bus provides 36 biu of address, 64 bits of data and the control 
signaU needed to support daU integrity, to perform bus transactions and to maintain 
a coh^en. shared memory in .he presence of multiple caches. The system bus can 
support eight physical loads at 66.7 MHz (and more at lower frequencies) so that an 
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additional I/O bus bridge, MC, or other custom attachment may be connected to the 
bus. 

With regard to the memory array, three different types of memory 
configurations are supported: A 4:1 interleaved DRAM configuration with up to 4 
5 GBytes of memory using 64 Mbit technology, a 2:1 interleaved DRAM configuration 
with up to 2 GBytes of memory and a non-interleaved DRAM configuration with 
up to 1 GByte of memory. In this particular embodiment, up to four MCs can be 
coupled to the system bus so that a fully configured system would have a maximum 
of 16 GBytes of addressable memory. 

10 The memory supported by the MC is preferably arranged as 8 rows with 1, 2 or 

4 interleaves. Although any type of RAM devices may be implemented for the 
memory, those preferably used comprise either discrete memory devices, single- 
sided SIMMs, or double-sided SIMMs. For all three memory configuration types, 
the MC provides 8 logical RAS# signals, one per row, while two copies of the RAS# 

15 signals are provided for fanout. For implementations that use discrete memory 

devices or single-sided SIMMs, the MC provides 8 logical CAS# signals, one per row, 
while two copies are again provided for fanout. In the case of double-sided SIMMs, 
however, there are only 4 logical CAS# signals (one per pair of rows), and the 
loading per CAS# signal is doubled. 

20 As shown in Figs. 1 and 2, the MC is divided into two ASICs, one forming a 

DRAM Controller (DC) component and the other forming a Data Path (DP) 
component, thereby separating the address and data flows through the MC so as to 
improve the speed and efficiency of the pipelined system. The DC component is 
responsible for handling all memory access requests and commands in addition to 
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for the flow of dau to and from memory. 

The compo,»nts are fupcUomUy coupled together via .n interchip 
interconnect OCIC) which comprise, the communicaHons paths an. b>^ers n^- 

5 «> transmit dau and control informabon between the two componenU. Both ASICs 
ate designed to connect direcUy to the system bus without external glue 
exponents, a.^ together co.btute orfy one bus load. It is noted that ald»ugh m 
ftis particular embodiment the MC is preferably divided into separate ASIC 
components for purposes of improving efficiency, speed and cos, effectiveness, .t 

.0 canbe implemented as a single package, memo^ controUer, and can further be 
„mi^ in both multiprocessor and single processor as well as pipeUned and non- 
pipelined systems. 

Keferring to F,g. the DC component of the MC is functionally divided in«. 
sue. main sectlo,.: a Bus Engine (BEC), a Queue Management Unit (QMUC) and a 
15 Me^r^ry Interface (MMIC). The BEC provides the interface to .he system bus. wlule 
*e QMUC couples the BEC to the MMIC The BEC contains an in-order-queue for 
system bus requests in addition to the state machines (not shown) needed to control 
O^abus and to orchestrate dautrar^fersbetweenthebus and ttaMCdaU buffers 

shown). The BEC is also responsible for buffer allocadon in the DP component 
20 and for control of the sys.em.ide interface (SYSmP) of the IQC locat«i witWn ^ 
PP component. Th. QMUC is me primary interface between the BEC and the MMIC 
and comprises an outbound reques. queue (ORQ) for buffering system bus requests 
ami corresponding addresses recdved from the BEC for use by the MMIC m . 
decoding requests and performing memory accesses. . 
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The MMIC provides all of the memory controller specific functionality of the 
'MC The primary functions of the MMIC is to control the MMIC side of the QMUC, 
to control the external memory and to control the interchip interface to the DP 
. component. This unit also converts system requests into both DRAM memory 
5 cycles and interchip cycles. With regard to the interface between the MMIC and the 
ORQ of the QMUC, the MMIC uses the ORQ 1) to provide the effective address of 
the current request to the ORQ, 2) to read the next request from the ORQ and 3) to 
read the status of the next request. In this respect, the MMIC also provides response 
iitformation back to the BEC via the QMUC. 

10 The main units of the MMIC preferably consist of 1) an address decode unit 

, (not shov^n) for determining if the registered bus request is destined for the MC and 
for computing an effective address for accesses to the DRAM array, 2) a request 
"address unit (not shown) for determining where in the memory array the request is 
destined and for maintaining a table of open pages in the memory array, 3) memory 
15 - control logic (MCL) for controlling the other units based on the current request from 
• the QMUC and for generating the memory subsystem control signals, the memory- 
. side interchip interconnect signals, the MIC interface signals, and the memory reset^ 
and error signals, 4) a configuration register unit (not shown) which contains all the 
configuration registers/logic and state machines required to precalculate internal 
20 parameters and control the reading and writing of configuration space via the ICIC, 
and 5) a memory refresh unit (not shown) for controlling the memory refresh rate. 

On the datapath side, the DP component of the MC supports the data flow 
from the system bus to the memory array and is disassociated from the memory 
control interfaces since all controls for the bus and the memory interface are 
25 generated by the DC component. The main functions of the DP component are to 
provide a 64 bit datapath from the bus to memory, to provide ECC support for data 
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both on the bus and in memory, to scrub (or correct) any correctable errors in the 
memory array and to provide support for single dock cycle data transfer bursts. 

Similar to the DC component, the DP is functionally divided into three main 
sections: a Bus Engine (BED), a Queue Management Unit (QMUD) and a Memory 
5 Interface (MMID). The BED provides the interface to the system data bus, while the 
QMUD couples the BED and the MMID through a plurality of data buffers. As 
shown in Fig. 2, the QMUD comprises a set of read data buffers for temporarUy 
. buffering data read from memory before it is placed on the system bus and a set of 

write data buffers for temporarily buffering write data received from the system bus 
10 before it is written to memory. The status of each request is kept track of by the MCL 
of the DC component from which the transaction.commands are generated. To 
maximize system performance in a pipelined system, each set of data buffers 
preferably comprises four buffers such that a totalof eight different memory accesses 
may be in process at any one time. Nonetheless, any number of read and write data 
15 buffers may be used in either a pipelined or non-pipelined system without departing 
from the spirit of the present invention. 

Also shown in Fig. 2, the MMID comprises « plurality of data and control 
interfaces to the QNfUD, the BED, the ICIC and the external memory array. The 
main functional units of the MMID include 1) the memory input/output (MIO) 
20 buffers, which buffer the signals used to access the memory array, 2) the DP 
operation registers, which can be programmed via the DC component for 
operational control of the DP component, 3) the data path unit (DPU), which 
transfers and converts the data from the QMUD buffers to the selected memory 
locations and vice versa, 4) the IQC, which includes the ICIC buffers, the system- 
25 side interface to the DP component (SYSIDP) and the memory-side interface to the 
DP component (MEMIDP) for buffering data and commands received from the MCL 
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of the DC component and converting the commands into data transfer control 
signals, and 5) the datapath state machines (DPSMs)/which provide the control for 
all other blocks based on the current command from the MCL via the ICIC. 

The laC is provided with a system-side interface (SYSIDP) for handling all 
5 commands related to the BED and QMUD data traffic and a memory-side interface 
(MEMIDP) which handles all commands related to the QMUD, the memory data 
traffic and the configuration programming of the DP component. Bus requests that 
target the MC will result in at least one command being generated by the MCL and 
transmitted to the SYSIDP where the commands are processed and converted into 
10 control signals which then direct the flow of data from the BED to the QMUD or 
vice versa. The commands processed by the SYSIDP will also be sent to the 
MEMIDP to iiutiate data transfers to or from memory. 

The DPSMs process memory commands received from the DC component 
and control the flow of data between the QMUD and the memory array. They also 
15 report errors, by type, to the DC component through the ICIC at the end of each ' 
memory command. The DP registers comprise among other things a configurable 
ECC register which is used to enable ECC checking for both system data and memory, 
data. 

With regard to the DPU, shown in Figs. 2 and3, a read data bus and a write 
20 data bus are provided within the DPU for transferring data from memory to the read 
data buffers of the QMUD and from the write data buffers of the QMUD to memory, 
respectively. As shown in Fig, 3, the DPU further comprises an ECC checking and 
correcting unit and an ECC code word generation unit Outbound, write data (i.e. a 
64 bit DWORD in the preferred embodiment) is first popped off a write data buffer 
25 and then input to the ECC code word generation unit where 8 parity bits are 
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calculated and appended to the DWORD to form a 72 bit code word that is 
subsequently written to memory via the MIO buffers. 

Conversely, inbound, read data in the form of a 72 bit code word is first 
buffered in the MIO buffers and then directed into the ECC checking and correcting 
5 unit to decode the code word, check the data for errors and correct any correctable 
errors before it is stored within a read data buffer of the QMUD. In this case, the 72 
bit code word is decoded into the original 64 bit DWORD and is further used to 
calculate a syndrome for the code word. The syndrome is input into an error 
classification circuit (not shown) to determine whether an error has occurred. If an 
10 error did occur, this error condition is reported to the MCL of the DC component v.a 
the Error bus as either a correctable error or an uncorrectable error. If correctable, the 
syndrome for the code word is input into an error correction circuit (not shown) so 
as to produce as output to the read data buffers of the QMUD a corrected DWORD. 
Although any ECC code can conceptuaUy be utilized with the present 
15 invention, the ECC code preferably used in this embodiment of the invention 

comprises a (72. 64) SEC-DED^4ED rotational error control code, this code being 
described in detail in co-pending U.S. Patent Application Serial No. 08/176. 351 
filed on December 30, 1993 and assigned to Intel Corporation of Santa Clara. The 
H matrix for the code is designed to support parity checking over a 72 bit datapath. 
20 All columns of the H matrix are linearly independent and have an odd weight, 
vdth each 4-bit wide column being arranged to allow detection of 4-bit byte errors. 
One and two bit errors are covered by the SEC-DED property, whereas triple errors 
generate an odd syndrome of weight 5 that is not equal to any of the weight 5 
columns in H. Additionally quadruple errors give a non-zero, even weight 
25 syndrome. Accordingly, this particular ECC code can (1) detect and correct single 
bit errors rince each column is unique. (2) detect double bit errors since each 
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column has an odd weight, (3) detect three bit errors within a nibble since such 
errors will result in a syndrome that has one half with weight 2 and one half with 
weight 3, and (4) detect four bit errors within a nibble since such errors will result 
in a s}mdrome that cannot sum to zero. 

5 

IL ATTTOMATIC FCC SCRUBBING MECHANISM 

In accordance with the embodiment of the MC described above, automatic 
memory scrubbing upon memory reads is accomplished by means of a corrected data 
writeback path coupled between the outputs of the QMUD read and write buffers 

10 and under the control of the MCL via the DPSMs. With reference to Fig. 3, once the 
ECC checking and correcting unit has detected a correctable error in a code word read 
from memory/ that unit signals this condition to the MCL via the Error Bus, corrects 
the erred data and then sends it to the next allocated read buffer in the QMUD. 
Under the control of the MMIC, the BED evenhially places the data from the read 

15 buffer onto the system bus for use by the microprocessoris or other bus agents. 

However, a copy of the data is still maintained in the allocated read buffer until the 
corresponding* request in the request queue of the DC component is retired. 

The MCL keeps track of the buffer allocation, and specifically the buffer in 
which the now corrected data is stored, and tags the corresponding bus request in the 

20 request queue with an error tag. Subsequently, the MCL issues a memory scrub 
command to the DPSMs on the MEMCMD bus (via the ICIC) to signal that the 
corrected data within the read buffer is to be written back to the memory location 
from. which it came. Due to the pipelined nature of the MC, the scrub command 
need not be immediately issued since the MCL may choose to perform or complete 

25 higher priority transactions using the other data buffers of the QMUD. However, if 
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„„ly one re,d da« buffer is provided, ften memory soubbir^ would have » be 
pjormed before tt» corrected read da« i. disc^ed from U« buffer, or overv,rrHen 
by subsequent reads. 

Once received by tt» DPSMs, ^ scrub command, which idendHes fte 
5 pardcular buffer in which the correCed d=U is s»red in addidon .o d« iengft of d« 
aa«, is decked and a scrub processing sU« machine pPSM) U sele«ed so as .o 
i^Ua^ s.ub processing. In scrub processing. SPSM firs, issues a read source 
' select signal .0 a read data buffer conttol muldplexor (CU MUX) «> select .he 
control source wWch will issue commands to access the pardcular buffer and iniuate 
^0 a read operation therefrom. Upon assertion of the read source select signal, the 
output of the Cn. MUX is changed from a first ^us agenf con»ol input (via the 
system bus) «. a second MCL control input fr». (via .he MEMCMD bus) so that the 
MCL now conwls *e addressing and strobing of *e read daB buffers. 

Th, SPSM machine also issues a buffer select signal .0 an output multiplexor 
15 (OUT MUX) Which has as input both *e write data and the read data from theu 

respecdve buffers and has its output coupled to memory via the MIO buffers and the 
MICs once Ws has been done. SPSM B>en asser«.a read enable signal to the 
CIL MUX to read the data out of the read data buffer. The data passes through the 
OUT MUX and into the ECC code word g««raaon unit where a code word u 
,0 produced and stored within the MIO buffers of the MMID. The MCL tt»n issues ^ 
appropriate address and requ^t a«ribu.. signals to both *e MlCs and memory 
array via a memory control bus so fta. the corrected data is then wrt«en to the 
locaUon in memory specified by the origin^ read requestsdll presen. in t*. re,ues, 
queue of the DC component. Once the corrected dau has been wrirten back to the 
a same memory location, the tagged request in the request queue of the DC 
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component is retired and the corresponding read data tuffer is deallocated for future 



use. 



In another embodiment of the present invention, the above-described 
saubbing mechanism is used to perform periodic scrubbing of the memory array via 

5 the hardwired writeback path. Since any memory location in error is automatically 
scrubbed upon the correction of erred read data, the entire memory can simply be 
scrubbed by sequentially reading each location in memory. Furthermore, because 
the MCL of the MC generates the control signals appropriate for controlling the 
writeback of corrected data upon detection of correctable errors, the only software 

10 support required from the microprocessor is that needed to set-up the configuration 
registers for this particular operation. 

According to a second alternate embodiment, the same mechanism can be 
used to generate a new ECC code word when a read-modify-write operation is 
performed. This is accompUshed by first storing the partial write data in the write 

15 buffer and then reading the data to be modified from memory and storing it in the 
read data buffer. By applying the appropriate buffer select and byte enable signals to 
the output multiplexor, the datapath state machine will then merge both the read 
and write data via the output multiplexor as it is written to memory. This not only 
provides a simple and efficient means for generating ECC for a read-modify-write 

20 operation, but also ensures that an uncorrectable double bit error wUl not be 
produced in the process, thereby increasing system reliability. 

Accordingly, through the use of a hardware scrubbing mechanism comprising 
a corrected data writeback path to memory, the present invention is able to 
overcome the disadvantages inherent in conventional software-based scrubbing 
25 mechanisms. By utilizing the already existing ECC checking and correcting circuitry 
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and taking advantage of its operation, the invention is able to simplify previous 
ECC scrubbing processes so as to improve over-all system performance and 
reUabiUty while simultaneously reducing the amount of memory bandwidth 
required for memory scrubbing operations. 

It will be appreciated that various modifications and alterations might be 
made by those skilled in the art without departing from the spirit and scope of the 
present invention. Thus, it is understood that the invention is not to be limited by 
the foregoing illustrative details, but rather is to be defined by the appended 
claims. 
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CLAIMS 

1. In a memory controller coupled between a system bus and memory 
and having at least memory control logic and a first data buffer for transferring data 
between memory and the system bus, a method is provided for automatically 
scrubbing ECC errors in memory upon the detection of a correctable error in data 
read from a particular memory location, the method comprising the steps of: 

correcting the erred read data; 

storing the corrected read data in the first data buffer; 

writing the corrected read data stored in the first data buffer onto a writeback 
path, the writeback path having an input end coupled between the first data buffer 
and the system bus and an output end coupled to memory; and 

writing the corrected read data from the writeback paUi to the particular 
memory location in accordance with memory control signals generated by the 
memory control logic. 

2. The method of claim 1, wherein upon detection of a correctable error in 
data read from memory, the method further comprises the step of signaling to the 
memory control logic the existence of the correctable error. 
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3. The method of claim 2, wherein the step of writing the corrected read data 
stored in the first data buffer onto a writeback path comprises the step of transferring 
coiitrol of the first data buffer from a bus agent coupled to the system bus tp the 
memory control logic to control the wriUng of the corrected read data exclusively 
with memory conbrol signals generated by the memory control logic. 



4. The method of claim 2, wherein the memory controller comprises a 
plurality of data buffers to permit simultaneous accesses to memory via 
corresponding memory access requests presented to the memory control logic, and 
the step of writing the corrected read data stored in the first data buffer onto a 
writeback path comprises the steps of: 

associating an error tag with a current memory access request in a request 
queue of the memory control logic upon signaling the existence of the correctable 
error to identify the memory access request associated with erred read data; 

transferring control of the first data buffer from a bus agent coupled to the 
system bus to the memory control logic; and 

controlling the writing of the corrected read data exclusively with memory 
control signals generated by the memory control logic. 



5. The method of claim 4, wherein the step of writing the corrected read data 
stored in the first data buffer onto a writeback path is performed subsequent to other 
memory accesses having higher priority. 



- 21 - 



6. The method of claim 2, wherein the step of writing the corrected read data 
stored in the first data buffer onto a writeback path comprises the steps of: 

providing as a first input to a control multiplexor memory control signals 
generated by a bus agent and transmitted via the system bus; 

providing as a second input to the control multiplexor memory control 
signals generated by the memory control logic; 

selecting as output from the control multiplexor the memory control signals 
generated by the memory control logic upon signaling the existence of the 
correctable error; and 

supplying the output from the control multiplexor as a control/address input 
to the first data buffer to read the corrected data stored in the first data buffer out 
onto the writeback path in accordance with the memory control signals generated by 
the memory control logic. 

7. The method of claim 6, wherein the step of writing the corrected read data 
from the writeback path to the particular memory location comprises the steps of: 

providing as a first input to an output multiplexor write data generated by the 
bus agent and transmitted via the system bus; 

providing as a second input to the output multiplexor the corrected read data 
on the writeback path; .... 
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selecting as output from the output multiplexor the corrected read data by the 
memory control logic upon signaling the existence of the correctable error; and 

writing the selected data output from the output multiplexor to the particular 
memory location. 



8. The method of claim 7, wherein the first data buffer comprises a read data 
buffer for storing data read from memory and the memory controller further 
comprises a write data buffer for storing data to be written to memory, and the step 
of providing as a first input to an output multiplexor write data generated by a bus 
agent ahd transmitted via the system bus comprises the steps of: 

writing the write data generated by a bus agent from the system bus to the 
write data buffer; and 

providing the write data stored in the write buffer as a first input to the 
output multiplexor. 



9. The method of daim 7, wherein the step of signaling to the memory 
control logic the existence of the correctable error is performed by the steps of: 

inputting the data read from memory to an error checking and correcting 
circuit; and 

transmitting from the error checking and correcting circuit to the control logic 
a correctable error detect signal upon detection of the correctable error. 
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10. The method of claim 1, wherein the memory controller comprises a 
plurality of read and write data buffers with each data buffer forming a dual port 
random access memory, the first data buffer forming a read data buffer. 

11. In a memory controller coupled between a system bus and memory 
and having at least memory control logic and a first data buffer for transferring data 
between memory and the system bus, a method is provided for automatically 
scrubbing ECC errors in memory, the method comprising the steps of: 

reading data from memory; . 

inputting the read data to an ECC error checking and correcting circuit to 
check the read data for errors and correct any correctable errors in the read data; 

transmitting from the error checking and correcting circuit to the memory 
control logic a correctable error detect signal upon detection of a correctable error in 
the data read from a particular memory location; 

storing the corrected read data output from the error checking and correcting 
circuit in the first data buffer; 

generating a first source select signal and a second buffer select signal by the 
memory control logic upon receipt of the correctable error detect signal from the 
error checking and correcting circuit; 

transmitting the first source select signal to a control multiplexor having as 
input memory control signals from each of a bus agent via the system bus and the 
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xnemory control logic to select as output from the contirol multiplexor the memory 
control signals input from the memory control logic; 

supplying as a control input to the first data buffer the output from the 
control multiplexor to control writing of the corrected read data stored in the first ! 
data buffer exclusively by the memory control logic; 

writing the corrected read data from the first data buffer onto a writeback path, 
the writeback path being coupled between the output of the first data buffer and a 
first input to an output multiplexor, the output multiplexor having as a second 
input write data provided by a bus agent via the system bus; 

supplying as a control input to the output multiplexor the second buffer select 
signal to select as output from the output multiplexor the corrected read data; and. 

writing the corrected read data back to the particular memory location in 
accordance with memory control signals generated by the memory control logic. - 



12. The method of claim 11, wherein the memory controller comprises a 
plurality of data buffers to permit simultaneous accesses to memory via 
corresponding memory access requests presented to the memory control logic, and 
the step of writmg the corrected read data from the first data buffer onto a writeback 
path is performed by the step of associating an error tag with a current memory 
access request in a request queue of the memory control logic upon receipt of the 
correctable error detect signal from the error checking and correcting circuit. 
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13. The method of claim 12, wherein the step of writing the corrected read 
data from the first data buffer onto the writeback path is performed subsequent to 
other memory accesses having higher priority. 

14. The method of claim 11, wherein the first data buffer comprises a read 
data buffer for storing data read from memory and the memory controller further 
comprises a write data buffer for storing data to be written to memory, and the 
method further comprises the steps of: 

writing the write data provided by a bus agent from the system bus to the 
write data buffer; and 

providing the write data stored in the write buffer as the second input to ti^e 
output multiplexor. 

15. The method of claim 11, wherein the memory controller comprises a 
plurality of read and write data buffers with each data buffer forming a dual port 
random access memory, the first data buffer forming a read data buffer. 

16. The method of claim 11, wherein the step of reading data from memory is 
performed by sequentially reading each location in memory in the performance of 
periodic scrubbing, the reading of memory locations being controlled by memory 
control signals generated by the memory control logic. 
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17. In a computer system comprising a microprocessor, a system bus for 
coupling the microprocessor to memory and to a plurality of bus agents comprising 
at least input/output devices, and a memory controller having at least memory 
control logic for controlling accesses to memory, an apparatus is provided in the 
memory controller for automatically scrubbing ECC errors in memory upon the 
detection of a correctable error in data read from memory, the apparatus comprising: 

an ECC error checking and correcting unit for checking data read from 
memory for errors and correcting any correctable errors found in the read data, the 
ECC error checking and correcting unit signaling to the memory control logic the 
existence of a correctable error in the data read from a particular memory location 
upon detection of the correctable error; 

a first data buffer for storing the read data output from the ECC error checking 
and correcting urut; and 

a writeback path having an input end coupled to an output of the first data 
buffer and an output end coupled to memory for writing the corrected data from the 
first buffer to memory in accordance with memory control signals generated by the 
memory control logic in response to signaling the existence of a correctable error in 
the data read from a particular memory location. 



18. The apparatus of claim 17, wherein the apparatus further comprises a 
control multiplexor having as a first input memory control signals generated by the 
memory control logic, as a second input memory control signals generated by one of 
the microprocessor and a bus agent and as a third input a first select signal from the 
memory control logic for selecting as output one of the first and second inputs for 
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controlling the first data buffer in accordance with the particular memory control 
signals selected, the memory control logic asserting the first select signal upon 
signaling of the existence of a correctable error to cause the writing of the corrected 
read data stored in the first data buffer to be controlled exclusively by the memory 
control logic. 



19. The apparatus of claim 17/ wherein the apparatus further comprises an 
output multiplexor having as a first input the corrected read data on the writeback 
path, as a second input write data provided by one of the microprocessor and a bus 
agent and as a third input a sejcond select signal from the memory control logic for 
selecting as output one of the first and second inputs, the memory controller 
asserting the second select signal upon signaling of the existence of a correctable 
error to enable writing of the corrected read data placed on the writeback path to 
memory. 

20. The apparatus of claim 17, wherein the apparatus further comprises an 
output multiplexor having as a first input the corrected read data on the writeback 
path, as a second input write data provided by one of the. microprocessor and a bus 
agent and as a third input byte enable signals from the memory control logic for 
selectively merging partial write data input to the multiplexor from one of the 
microprocessor and a bus agent with the corrected read data stored in the first buffer 
in the performance of a read-modify-write operation. 
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21 The apparatus of claim 20, wherein the apparatus further comprises an 
ECC code word generation unit coupled to the output of a>e control multiplexor for 
encoding the merged data output from the conu^ multiplexor before it is written 

back to memory. 



22 The apparatus of claim 19, wherein the first data buffer comprises a read 
data buffer for storing data read from memory and the memory controller further 
comprises a write data buffer for storing data to be written to memory, with the read 
data stored in the read data buffer forming the first input to the output multiplexor 
and the write data stored in the write data buffer forming the second input to the 
output multiplexor. 



23 The apparatus of claim 17, wherein the memory controller further 
comprises a memory access request queue and a plurality of data buffers to permit 
simultaneous accesses to memory via corresponding memory access requests 
presented to the memory control logic and buffered in the request queue, the 
„.emory control logic associating an error tag with a memory access request buffered 
in the request queue identified as being the memory access request associated w.th 
the erred read data. 



24. The apparatus of claim 17, wherein the memory control logic initiates 
sequential reading of each location in memory in the performance of periodic 
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scrubbing, the reading of memory locations being controlled by memory control 
signals generated by the memory control logic. 
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